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The present invention relates to the field of nucleic acid sequence analysis. The invention provides methods for performing a 
polymerase signal assay (PSA) to analyze nucleotide sequences using solid phase sequence arrays comprising a plurality of sequence 
reagents with primer sequences 4-6 bases in length. The methods of the invention generate a binary signal pattern which can be used 
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can be identified by the methods of the invention include single nucleotide polymorphisms (SNP's), base deletions, base insertions, and 
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POLYMERASE SIGNALING ASSAY 

1. FIELD OF THE INVENTION 
5 The present invention relates to the field of nucleic 

acid sequence analysis. Specifically, the invention provides 
methods of nucleic acid sequence analysis which use 
combinatorial sequence array primers to sequence and/or 
detect mutations or polymorphisms within a template nucleic 
acid. 

10 

2 . BACKGROUND OF THE INVENTION 
This invention relates to the field of nucleic acid 
sequence analysis. The analysis of nucleic acid sequences 
can be used, e.g.,- to determine the presence or absence of a 

15 particular genetic element. Variant genetic elements of a 
nucleic acid sequence usually exist. Exemplary variant 
genetic elements may include, but are by no means limited to, 
genetic mutations or polymorphisms such as single nucleotide 
polymorphisms ("SNP's"), base deletions, base insertions, and 

20 heterozygous as well as homozygous polymorphisms. 

Accordingly many techniques have been developed to compare 
homologous segments of nucleic acid sequence to determine if 
the segments are identical or if they differ at one or more 
nucleotides. Practical applications of these techniques 

^ include genetic disease diagnoses, infectious disease 

diagnoses, forensic techniques, paternity determinations, and 
genome mapping. 

In general, the detection of nucleic acids in a sample 
and of the subtypes thereof depends on the technique of 
specific nucleic acid hybridization in which the 

30 oligonucleotide probe is annealed under conditions of high 
stringency to nucleic acids in the sample, and the 
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successfully annealed probes are subsequently detected (see, 
e.g., Spiegelman, S., 1964, Scientific American 210:48). 

The most definitive method for comparing DNA segments is 
to determine the complete nucleotide sequence of each 
5 segment. Examples of how sequencing has been used to study 
mutations in human genes are included in the publications of 
Engelke et al. (1988, Proc. Natl. Acad. Sci. U.S.A. 85:544- 
548) and Wong et al. (1987, Nature 330:384-386). The most 
commonly used methods of nucleic acid sequencing include the 
dideoxy-mediated chain termination method, also known as the 

10 

"Sanger Method" (Sanger, F. et al., 1975, J. Molec. Biol. 
94:441; Porbe, J. et al . , 1987, Science 238:336-340) and the 
chemical degradation or "Maxam-Gilbert " method (Maxam, A.M. 
et al. # 1977, Proc. Natl. Acad. Sci. U.S.A. 74:560). 

Both the Sanger and Maxim-Gilbert methods comprise a 

15 series of four chemical reactions, one for each of the 

nucleotide bases, e.g., A, C, G, and T for DNA, consisting of 
either primer extension (Sanger) or partial cleavage (Maxim- 
Gilbert) reactions. The reactions produce four sets of 
nested nucleic acid molecules whose lengths are determined by 

20 the location of a particular base along the length of the 
nucleic acid molecule being sequenced. The nested reaction 
products are then resolved by electrophoretic gels. 

The separation and analysis of reaction products on 
electrophoretic gels is a laborious and time consuming step. 

2g Accordingly, alternative methods have been developed to 
sequence nucleic acid molecules. For example, there is 
considerable interest in developing methods of de novo 
sequencing using solid phase arrays (see, e.g., Chetverin, A. 
B. et al., 1994, Bio/Technology 12:1093-1099; Macevicz, U.S. 
Patent No. 5,002,867; Beattie, W.G. et al., 1995, Molec. 
Biotech. 4:213-225; Drmanac, R.T., EP 797683; Gruber, L.S., 
EP 787183; each of which is incorporated herein by reference 
in its entirety) . These methods consist primarily of 
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hybridization of template nucleic acids to arrayed primers 
containing combinatorial sequences which hybridize to 
complementary sequences on the template strand. The methods 
combine the capture of the template, by formation of stable 
5 duplex structures, with sequence discrimination due to 
instability of mismatches between the template and the 
primer . 

Such methods must typically employ arrays of primers at 
least twelve bases in length which contain approximately 16 
million sequence combinations. Such arrays are very complex 
and time consuming both to construct and to analyze. Thus, 
at the present time it is not practical to use extensive 
sequencing methods, such as the methods described above, to 
compare more than just a few DNA segments because the effort 
required to determine, interpret, and compare complete 

15 sequence information is time-consuming. 

Restriction fragment length polymorphism ( " RFLP " ) 
mapping is another commonly used screen for DNA polymorphisms 
arising from DNA sequence variation. RFLP consists of 
digesting DNA with restriction endonucleases and analyzing 

20 the resulting fragment by means of Southern blots, as 
described by Botstein et al . , 1980 {Am. J. Hum. Genet. 
32:314-331) and White etal. (1988, Sci . Am. 258:40-48). 
Mutations that affect the recognition of sequence of the 
endonuclease will preclude enzymatic cleavage at that site, 
thereby altering the cleavage pattern of the DNA. DNAs are 
compared by looking for differences in restriction fragment 
lengths. However, a major problem with RFLP mapping is its 
inability to detect mutations that do not affect cleavage 
with a restriction endonuclease. Thus, many mutations are 
missed with this method. Further, the methods used to detect 

30 restriction fragment length polymorphisms are very labor 

intensive, particularly the techniques involved with Southern 
blot analysis. 



- 3 - 



WO 99/66076 



PCT/US99/13430 



Alternative, simpler methods have been developed which 
use solid phase arrays to analyze single nucleotide 
polymorphisms (SNP's). These techniques rely on the fact 
that analysis of SNP's, which constitute sites of variation 
5 flanked by regions of invariant sequence, requires no more 
than the determination of the identity of the single 
nucleotide present at the site of variation. 

For example, several primer-guided nucleotide 
incorporation procedures for assaying polymorphic sites in 
DNA have been described (e.g., Komher, J.S. et al., 1989, 

10 

Nucl. Acids Res. 177779-7784; Sokolov, B.P., 1990, Nucl . 
Acids Res. 18:3671; Syvanen, A.-C. et al . , 1990, Genomics 
8:684-692 ; Kuppuswamy, M.N. et al . , 1991, Proc. Natl. Acad. 
Sci. U.S.A. 88:1143-1147; Prezant, T.R. et al., 1992, Hum. 
Mutat. 1:159-164; Ugozzoli, L. et al . , 1992, GAT A 5:107-112; 

15 Nyren, P. et al., 1993, Anal. Biochem. 208:171-175; and 
Wallace WO89/10414) . Each of these methods relies on the 
incorporation of labeled deoxynucleotides to discriminate 
between bases at a polymorphic site. An alternative 
microsequencing method, the Genetic Bit Analysis (GBA™) 

20 method has been disclosed by Goelet, P. et al . (WO 92/15712) 
which avoids many of the problems in the above identified 
microsequencing assays. In GBA™, the nucleotide sequence 
information surrounding a predetermined site of interrogation 
is used to design an oligonucleotide primer that is 

25 complementary to the region immediately adjacent to, but not 
including, the predetermined site. The target DNA template 
is selected from the biological sample and hybridized to the 
interrogating primer. This primer is extended by a single 
labeled dideoxynucleotide using DNA polymerase in the 
presence of at least two, and most preferably all four chain 

30 

terminating nucleoside triphosphate precursors. 

Several variations of the GBA method have been 
developed, as well as other microsequencing methods (see, 
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e.g., Mundy, U.S. Patent No. 4,656,127; Vary and Diamond, 
U.S. Patent No. 4,851,331; Cohen, D. et al . , PCT Application 
No. WO91/02087; Chee, M. et al . , W095/11995; Landegren, U. et 
al., 1988, Science 241:1077-1080; Nicerson, D.A. et al . , 
5 1990, Proc. Natl. Acad. Sci . U.S.A. 87:8923-8927; Pastinen, 
T. et al. (1997, Genome Res. 7:606-614; Pastinen, T. et al . , 
1996, Clin. Chem. 42:1391-1397; Jalanko, A. et al . (1992, 
Clin. Chem. 38:39-43; Shumaker, J.M. et al . , 1996, Hum. 
Mutation 7:346-354; Caskey, C. et al., WO 95/00669). 
10 Although they are simpler to perform and analyze than de novo 
sequencing, such microsequencing methods require primers that 
hybridize to the target nucleic acid molecule at a site 
immediately adjacent to a polymorphism (or a site suspected 
of being next to a polymorphism) . Hence, such techniques 
require prior knowledge of a "wild type" nucleic acid 

15 

sequence. Further, the techniques are limited to identifying 
a specific mutation or polymorphisms, typically a SNP, at a 
specific location in a specific nucleic acid sequence. 
Finally, such techniques also typically require multiple 
interrogations per target base. 

20 

3 . SUMMARY OF THE INVENTION 
The present invention relates to a unique universal 

array system and its use for nucleic acid sequence analysis. 

The array system of the invention can be used for nucleic 
25 acid sequence analysis, for example to identify DNA and other 

nucleic acid sequences as well as mutations and polymorphisms 

in those sequences . 

The principles of the sequence analysis system and 

methods of the invention involve: (i) capture of the 

polynucleotide to be analyzed, referred to herein as the 

30 

"template" by a specific "capture moiety" at a particular 
"capture region" on the template, (ii) scanning of the 
template by a combinatorial primer-polymerase complex for 
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regions of complementarity, and (iii) signal generation 
through a template homology dependent primer extension 
reaction wherein the primer is extended by one or more 
nucleotides or nucleotide analogues. 
5 Because primer extension occurs only when a region of 

complementarity to the primer is encountered on the template, 
the extension signal generated in step (iii) is the key 
element that identifies the existence of a specific nucleic 
acid sequence in the template polynucleotide. If no sequence 
complementary to the combinatorial primer is present in the 
template, primer extension does not occur, and an extension 
signal is not generated. Thus, by performing the methods of 
the invention with an array comprising a plurality of 
combinatorial primers, a unique pattern or matrix of extended 
vs. non-extended primer signals is generated for the 

15 polynucleotide sequence being analyzed. 

In a preferred embodiment, the array system of the 
invention comprises an array of sequence reagents arrayed on 
a solid support, and comprising (a) a capture moiety which 
can form a stable complex with a region of a template, (b) a 

20 spacer region, and (c) a primer region comprising 3-7 bases 
which, in the presence of a polymerase, can recognize 
complementary sequences in the template and can be extended 
by one or more nucleotides moieties (i.e., nucleotides or 
nucleotide analogues) by a template dependent primer 
extension reaction. Most preferably, the invention is 

25 

practiced using a universal sequencing array, as describe by 
Head, S. et al . (U.S. Patent Application Serial No. 
08/976,427, filed November 21, 1997). 

Preferably, the nucleotide moieties used for template 
dependent primer extension are labeled nucleotides. In 
3° various embodiments, the nucleotide moieties may be, e.g., 
chain terminating nucleotides such as dideoxynucleotides, or 
nucleotides that are not chain terminating such as 
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deoxynucleotides . Preferably, the nucleotide moieties 
comprise nucleotides or nucleotide analogues for all four of 
the nucleotide bases, A, G, C, and T. 

The invention is based, in part, on Applicants' 
5 discovery that nucleic acid sequences can be analyzed on 
solid phase arrays, referred to herein as sequence arrays, 
through the novel analysis of "on/off" polymerase extension 
signals on the arrays. The methods do not require that the 
actual identity of the polymerase extension product be 
determined, merely the determination of whether a polymerase 

10 

extension product is or is not present for each element of 
the array. Subsequences of the template nucleic acid 
molecule can then be identified through simple analysis of 
these on/off polymerase extension signals, and mutations and 
polymorphisms in the template nucleic acid molecule are 

*5 thereby detected. 

The present invention overcomes the limitations in the 
prior art by providing methods for the analysis of nucleic 
acid sequences which employ arrays of primer sequences that 
are three to seven bases in length. Such primer arrays have 

2o no more than 4096 possible sequence combinations (for six 

bases) , and can successfully analyze polynucleotides of up to 
several thousand bases. In one preferred embodiment, such an 
array will have only 1024 sequence combinations (for five 
bases) , and can analyze the sequence of a polynucleotide 
having up to about 1000 bases. In another preferred 

25 

embodiment, such an array will have only 256 sequence 
combinations (for four bases) , and can analyze sequences of 
up to about 3 00 bases. In certain other embodiments, it is 
possible to use arrays having even fewer sequence 
combinations. Such simple arrays are significantly easier to 
30 manufacture and screen than the solid phase arrays typically 
used for de novo sequencing techniques, which comprise 
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millions of sequence combinations and/or employ sequences 
derived empirically from the target sequence. 

The methods of the present invention also provide a 
method for analyzing nucleic acid sequences wherein a 
5 "binary" polymerase extension signal is generated for each 
primer in the array. Thus, signal analysis simply comprises 
detecting the presence or absence of a polymerase extension 
product. Neither identification of the extension product, nor 
discrimination between the different extension products is 
required. 

10 

Signal analysis is further simplified in the present 
invention in that analysis of the resulting signals simply 
comprises analyzing the binary signal pattern or "matrix" of 
the primer array. The pattern is then used to determine the 
presence or absence of a mutation, a polymorphism, or the 

!5 sequence of a target nucleic acid molecule. Thus, the 

methods of the invention utilize a pattern matching approach 
to sequence analysis, thereby reducing or even eliminating 
the need for complicated or intensive methods of analysis. 
The present invention further improves upon the prior 

2o art in that no prior knowledge of the nucleic acid sequence 
to be analyzed is required to practice the methods of the 
invention. The methods of the invention can also be used to 
detect any mutation or polymorphism at any location in a 
nucleic acid molecule, and are not limited to assaying for a 
particular mutation or SNP. Thus, the methods of the present 

25 

invention are not limited to merely determining the identity 
of a nucleotide base at a specific position in a nucleic acid 
molecule. Examples of mutations and polymorphisms which can 
be detected by the present invention include, but are by no 
means limited to, single nucleotide polymorphisms (SNP's), as 
30 well as base deletions, base insertions, and heterozygous 
mutations and polymorphisms. 
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The solid phase arrays used by the methods of the 
invention are simple to produce, and analysis of the 
resulting on/off polymerase extension signals is simple and 
straightforward, reducing the need for complicated, intensive 
5 methods of analysis. Indeed, the PSA methods of the 
invention are ideally suited for simple pattern matching 
methods of analysis. 

r 4 . BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a diagrammatic representation of the structure 

10 

of the sequence reagent used in the methods of the present 
invention. 

FIGS. 2A-C illustrate the steps of the polymerase 
signaling assay (PSA) of the present invention; FIG. 2A 
illustrates the "capture" step wherein the capture region of 

!5 the template nucleic acid molecule forms a stable complex 
with the capture moiety of a sequence reagent; FIG. 2B 
illustrates the "scan" step wherein the primer region of the 
sequence reagent recognizes a complementary region of the 
template nucleic acid molecule; FIG. 2C illustrates the 

20 "signal" step wherein the primer is extended by one or more 
. nucleotides , 

FIGS. 3A-D illustrates the expected (FIG. 2A, FIG. 2C) 
and experimental (FIG. 3B, FIG. 3D) data signals from PSA 
analysis of the "wild type" and "mutant" nucleic acid 
sequences shown in Table 1, using a sequence array comprising 
the 21 primer sequences listed in Table 2. 

FIGS. 4A-C illustrates data signal patterns predicted 
for PSA analysis of p53 exon 8; FIG. 4A illustrates the PSA 
data signal pattern for a wild type p53 template; FIG. 4B 
illustrates the PSA data signal pattern of a p53 template 
30 having a single nucleotide polymorphism (SNP; 38C - T) ; FIG. 
4C illustrates the difference between the two data signal 
patterns shown in FIGS. 4A-B. 
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FIGS. 5A-C illustrates predicted data signal patterns 
for PSA analysis of p53 exon 8; FIG. 5A illustrates the data 
signal pattern of a wild type p53 template; FIG. 5B 
illustrates the data signal pattern of a p53 template having 
5 the SNP 38C - A; FIG. 5C illustrates the difference between 
the two data signal patterns shown in FIGS. 5A-B. 

FIGS. 6A-C illustrates predicted data signal patterns 
for PSA analysis of p53 exon 8; FIG. 6A illustrates the data 
signal pattern obtained for a wild type p53 template; FIG. 
6B illustrates the data signal pattern of a p53 template 
having a the heterozygous polymorphism 3 8C - A; FIG. 6C 
illustrates the difference between the two data signal 
patterns shown in FIGS. 6A-B. 

FIGS. 7A-C illustrates predicted data signal patterns 
for PSA analysis of p53 exon 8; FIG. 7A illustrates the data 
15 signal pattern of a wild type p53 template; FIG. 7B 

illustrates the data signal pattern for PSA analysis of a p53 
template having a five base deletion; FIG. 7C illustrates 
the difference between the two data signal patterns shown in 
FIGS. 7A-B. 

20 FIGS. 8A-C illustrates predicted data signal patterns 

for PSA analysis of p53 exon 8; FIG. 8A illustrates the data 
signal pattern obtained for a wild type p53 template; FIG. 
8B illustrates the data signal pattern for PSA analysis of a 
p53 template having a five base insertion; FIG. 8C 

25 illustrates the difference between the two data signal 
patterns shown in FIGS. 6A-B. 

5 . DETAILED DESCRIPTION OF THE INVENTION 
The present invention includes methods of an assay 
^ referred to herein as the polymerase signaling assay or PSA. 
These methods are used for analyzing the sequence of nucleic 
acid molecules. A nucleic acid molecule which is' sequence 
analyzed by the methods of this invention is defined herein 
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as a template nucleic acid molecule, or template molecule, or 
template. Template nucleic acid molecules which may be 
sequence analyzed by the methods of the invention include DNA 
molecules, such as, but by no means limited to, genomic DNA 
5 molecules, cDNA molecules, and fragments thereof. Template 
nucleic acid molecules which may be sequence analyzed by the 
methods of the invention also include RNA molecules, such as, 
but by no means limited to, messenger RNA (mRNA) molecules, 
ribosomal RNA (rRNA) molecules, and fragments thereof. 

10 

The nucleic acid molecule to be analyzed by the PSA 
methods of this invention may be from any source. For 
example, the template nucleic acid molecule may be a 
naturally occurring nucleic acid molecule such as a genomic 
or extragenomic DNA molecule isolated from an organism, or an 

15 RNA molecule, such as an mRNA molecule, isolated from an 

organism. Alternatively, the template nucleic acid molecule 
may be synthesized, including, e.g., nucleic acid molecules 
synthesized enzymatically in vivo or in vitro, such as, for 
example, a cDNA molecule, or a nucleic acid molecule 

20 synthesized by PCR. The sample of nucleic acid molecules can 
comprise, e.g., molecules of deoxyribonucleic acid, 
ribonucleic acid, or copolymers or deoxyribonucleic • acid and 
ribonucleic acid. 

The invention is based, in part, on the discovery by 
applicants that nucleic acid sequences can be analyzed on 
solid phase arrays, referred to herein as sequence arrays, 
through the novel analysis of "on/off* polymerase extension 
signals on the arrays. The methods do not require that the 
actual identity of the polymerase extension product be 
determined, merely the determination of whether a polymerase 

30 extension product is or is not present for each element of 
the array. Subsequences of the template nucleic acid 
molecule can then be identified through simple analysis of 
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these on/off polymerase extension signals, and mutations and 
polymorphisms in the template nucleic acid molecule are 
thereby detected. 

The solid phase arrays used by the methods of the 
5 invention are simple to produce, and analysis of the 

resulting on/off polymerase extension signals is simple and 
straightforward, reducing the need for complicated, intensive 
methods of analysis. Indeed, the PSA methods of the 
invention are ideally suited for simple pattern matching 
methods of analysis. 

10 

The following subsections present the methods of the 
invention in greater detail. In particular, Section 5.2 
first describes the sequence arrays used in the methods of 
the present invention, as well as methods for preparing such 
arrays. The methods of the polymerase signaling assay are 
15 then described, in detail, in Section 5.3. 



5.1. SEQUENCE ARRAYS 
Sequence arrays such as those used in the methods of the 
present invention are described, in detail, by Head, S. et 
20 al. (U.S. Application Serial No. 08/976,427, filed November 
21, 1997) , which is incorporated herein by reference in its 
entirety. Such sequence arrays are also discussed below, as 
they pertain specifically to the PSA methods of the present 
invention. 

25 

The sequence arrays of the present invention comprise 
solid phase arrays of combinatorial oligonucleotides. Such 
arrays comprise a plurality of sequencing reagents 
immobilized, either individually or in a group, to a solid 
surface in a spatially distinct fashion. The sequence 
30 reagents of the sequence arrays each comprise (i) a "capture 
moiety", (ii) a "spacer region", and (iii) a combinatorial 
primer, or "primer region" which is also referred herein as 
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the primer sequence. The methods of the present invention 
utilize sequence arrays wherein the primer of each sequencing 
reagent preferably comprises from four to six bases which can 
recognize a complementary sequence of a template nucleic acid 
5 molecule. 

5.1.1. THE SEQUENCE REAGENT 
The sequence reagents of the sequence arrays used in the 
PSA methods of the present invention are designed for use as 
part of a combinatorial array for primer extension-based 

10 

sequence analysis of template nucleic acid molecules. A 
specific, preferred embodiment of the sequence reagent is 
illustrated in FIG. 1. The individual components of the 
sequence reagent, which are illustrated in FIG. 1, are 
described below. 

15 

Attachment Moietv : 

Under one preferred embodiment, the sequence reagent 
includes an optional "attachment moiety" (FIG. 1; AM) which 
is coupled to one terminus of the sequencing reagent, 
20 preferably the 5 1 -terminus, and functions to attach the 
sequence reagent to the solid surface. Preferably the 
attachment moiety specifically attaches the sequence reagent 
to the solid surface. 

In an alternative embodiment, the sequence reagent is 
non-specif ically attached to the solid surface. The sequence 

25 

reagent can be non-specif ically attached to the solid 
surface, e.g., by means of a cationic agent, such as octyl- 
dimethyl amine HCl or NaCl . Alternatively, the sequence 
reagent can be non-specif ically attached to a charged 
surface, such as an amino modified solid surface. 
30 Under another preferred embodiment, the sequencing 

reagent is specifically attached to the solid surface. 
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Preferably, the specific attachment of the sequence reagent 
is by means of a reversible bond. 

Under one preferred embodiment, the sequencing reagent 
can be specifically attached to the solid surface by means of 
5 a non-covalent bond. For example, a biotin or iminobiotin 
labeled oligonucleotide may be immobilized to an avidin or 
strepavidin coated solid surface. Alternatively, a 
haptenated oligonucleotide may be immobilized to an antibody 
coated solid surface. However, it is to be understood that 
other ligand receptor interactions are suitable for use in 
the present invention. 

Under another preferred embodiment, the sequencing 
reagent is specifically attached to the solid surface by 
means of a covalent bond. Preferably, the covalent bond is a 
disulfide bond. Additional embodiments for attaching the 

15 sequencing reagent to the solid surface are discussed below 
with respect to the capture moiety. The various embodiments 
for immobilizing a nucleotide to a solid surface discussed 
with respect to the capture moiety, described below, can also 
be applied to immobilize the present sequencing nucleotides 

2Q to a solid surface. 

Exemplary attachment moieties suitable for use in the 
present invention therefore include the incorporation of an 
amino, thiol, disulfide, biotin, etc. group at the 5'- 
terminus of the sequence reagent. This modification can be 
done either at the time the sequence reagent is synthesized, 

25 

or after synthesis of the sequence reagent . 

Capture Moiety : 

The "capture moiety" (FIG. 1; CM) of the sequence 
reagent is a moiety which is capable of forming a stable 
30 complex with a region of the template nucleic acid molecule, 
referred to herein as the "capture region" of the template 
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molecule. The capture moiety can be near either the 5' or 3 1 
terminus of the sequence reagent . 

The capture moiety can be, e.g., a DNA, RNA, or PNA 
(protein nucleic acid) sequence. The nucleic acid sequence 
5 may also contain modified bases. For example, a RNA sequence 
may contain 2 * -O-methoxy RNA, and a DNA sequence may contain 
5-Me-dC, pdC, pdU, or 2-amino-dA. Under another embodiment, 
the nucleic acid sequence may contain a modified backbone 
wherein the backbone is modified with phosphorothioate, 
phosphordithioate, methyl posphonate, or H-phosphonate . 

10 

Under another preferred embodiment, the capture moiety 
may be biotin, iminobiotin, avidin, strepavidin, antibody, 
hapten, a receptor, a ligand, or a charged base. Receptors 
and ligands suitable for use in the capture moiety include, 
but are not limited to, protein A, protein G, the Fc portion 

15 of an antibody, or Fc receptor. 

The capture moiety can also form a stable complex with 
the capture region of the template nucleic acid molecule,, 
e.g., by means of a disulfide bond, a covalent ether or 
thioether linkage via an epoxy, UV cross -linkage, a 

20 condensation reaction with a carbodiimide, a 

bromoacetyl/thiol linkage to a thioester, a crosslinkage with 
a bi- functional group, or a complex between thiol and gold. 
Bi-functional crosslinking reagents suitable as capture 
moieties in the present invention include, but are not 

^ limited to, an imidiester, N-hydroxysuccinimidyl ester, 
malemide, alkyl halide, aryl halide, alpha-haloacyl , and 
pyridyl disulfide. The capture moiety can also be covalently 
attached, e.g., by means of a labeling group. Labeling 
groups suitable for use in the present invention include, but 
are not limited to, amino, sulfhydryl, disulfide, phosphate, 

30 thiophosphate, dithiophosphate, and psoralen groups. 

The capture moiety can also exist as a separate molecule 
that is co-attached to the solid phase and effectively brings 
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the template into close proximity to the primer region of the 
sequence reagent (FIG. 1; PR) . 

In one of the preferred embodiments, the capture moiety 
comprises a sequence of 8-24 cytosine bases which can 
5 hybridize to a sequence of 8-24 guanine bases incorporated on 
the template strand. In another preferred embodiment, the 
capture moiety comprises a specific sequence complementary to 
a PCR primer, or portion thereof, used to. amplify a region of 
the template strand. For example, the capture moiety can 
comprise a sequence complementary to a restriction site found 

10 

within the PCR primer. Alternatively, the PCR primer 
hybridizes to a promoter site and thus, the capture sequence 
is the same sequence as a promoter site. PCR primer design 
is well known to those of skill in the art and it is 
appreciated that the capture sequence can be complementary to 
15 a PCR primer, to a portion thereof, or to a modification 
thereof . 

The efficiency of hybridization and the thermal 
stability of hybrids formed between the target nucleic acid 
and a short oligonucleotide probe depends strongly on the 
20 nucleotide sequence of the probe and the stringency of the 
reaction conditions (see, e.g., Conner, B.J. et al . , 1983, 
Proc. Natl. Acad. Sci . U.S.A. 80:278-282). Appropriate 
hybridization conditions for specific embodiments of the 
capture moiety and capture region will therefore be apparent 
to those skilled in the art, in view of the above and other 

25 

references well known in the art. 

Spacer Region : 

The sequence reagent additionally comprises a spacer 
region (FIG. 1; SR) . Preferably, the spacer region is at 
30 least 10 nm in length, more preferably 10-100 nm in length. 
However, the spacer region can also be greater than 100 nm 
length. Spacer regions suitable for use in the present 
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invention include, but are not limited to, DNA or RNA 
sequence, PNA sequence, polyethylene glycol groups, 5-nitro- 
indole* groups, or other chemical spacer arms. The spacer 
region can also consist of analogues of DNA, RNA, and PNA. 
5 In such embodiments, the nucleic acid sequences of the spacer 
region may comprise unmodified or modified nucleotide bases, 
such as the modified bases describe above for the capture 
moiety. Preferably, the spacer region consists of a random 
sequence of bases. However, the spacer region can also 
consist of a pseudo-random or non-random sequence of bases. 

10 

The spacer region is preferably designed to minimize 
template independent noise. Template independent noise is 
the result of signal detection independent (i.e., in the 
absence) of the template molecule. Under one embodiment, a 
spacer region is additionally placed in between the capture 
15 moiety and the attachment moiety. 

Primer Region : 

Finally, the sequence reagent also comprises a primer 
region (FIG. 1; PR) , also referred to herein as the primer. 
20 The primer region consists of specific bases which can 
recognize sequences on the template strand, and can be 
extended by a polymerase with one or more labeled 
nucleotides. In particular, as used in the PSA methods of 
the present invention, the primer consists of four to six 
specific bases which can recognize complementary sequences on 

2 S 

the template strand. Generally, the primer will comprise a 
DNA or RNA sequence four to six of the naturally occurring 
nucleotide bases. Such sequences are also referred to herein 
as the primer sequences of the sequence reagent . 
The primer sequence may be, e.g., an 
30 oligodeoxyribonucleotide, and oligoribonucleotide, or a 

copolymer of deoxyribonucleotides and ribonucleotides. The 
primer sequence can be synthesized either enzymatically in 
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vivo, enzymatically in vitro, or non-enzymatically in vitro, 
e.g., by the methods described in Section 5.1.2 below. 

The primer sequence, or alternatively the sequence 
reagent, may also be labeled with a detectable marker or 
5 label, including the detectable labels described in Section 
5.2.4, below. Preferably the detectable label of marker used 
to label the primer or sequence reagent will be different 
from the labels used to label either the extended primer, or 
the template. 

10 

5.1.2. SYNTHESIS OF A SEQUENCING REAGENT 
There are two preferred methods for making the 
sequencing reagents of a sequence array. The first method is 
to synthesize the specific oligonucleotide sequences directly 
on the solid-phase (in situ) in the desired pattern (i.e., in. 

15 the desired spatially distinct fashion), as described, e.g., 
by Southern et al . (1994, Nucl. Acids Res. 22:1368-1373), by 
Maskos et al. (1992, Nucl. Acids Res. 20:1679-1684), and by 
Pease et al. (1994, Proc. Natl. Acad. Sci. U.S.A. 52:5022- 
5026) . The other method is to first pre-synthesize the 

20 oligonucleotides in an automated DNA synthesizer, such as an 
ABI 392, and to then attach the synthesized oligonucleotides 
onto the solid-phase at specific locations (see, e.g., 
Lamture et al., 1994, Nucl. Acids Res. 22:2121-2125; and 
Smith et al., 1994, Nucl. Acids Res. 22:5456-5465). 

25 In the first method, the efficiency of the coupling step 

of each base will affect the quality and integrity of the 
nucleic acid molecule array. This method generally yields a 
large percentage of undesired incomplete (i.e., shortened) 
sequence which can create problems in the analysis step, and 
thereby effect the integrity of the analysis. Thus, the 

30 

quality and integrity of an array synthesized according to 
the first method is inversely proportional to the length of 
the nucleic acid molecule. Specifically, the synthesis of 
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longer oligonucleotides results in a higher percentage of 
incomplete, shortened sequences. 

The second, more preferred method for nucleic acid array 
synthesis utilizes an automated DNA synthesizer for DNA 
5 synthesis. Oligonucleotides are synthesized using standard 
phosphoramidite chemistry (see, e.g., Matteucci, M.D. et al., 
1981, J. Amer. Chem. Soc. 103:3185-3191). Preferably, a 
segmented synthesis strategy is used to simultaneously 
synthesize large numbers of oligonucleotides (see, e.g., 
Beattie, K.L. et al., 1988, Appl. Biochem. Biotechnol . 
10:510-521; Beattie, K.L. et al., 1991, Nature 352:548-549). 
The controlled chemistry of an automated DNA synthesizer 
allows for the synthesis of longer, higher quality DNA 
molecules than is possible with the first method. Also, the 
nucleic acid molecules synthesized according to the second 

15 

method can be purified prior to the coupling step. 
Therefore, the quality of the nucleic acid molecule array can 
be expected to be much higher than the quality of the nucleic 
acid array of the first method. 

20 5.1.3. IMMOBILIZATION TO A SOLID PHASE 

Several methods have been proposed as suitable for 
immobilizing an oligonucleotide to a solid support. For 
example, Holmstrom, K. et al. (1993, Anal. Biochem. 209:278- 
283) exploit the affinity of biotin for avidin and 

25 streptavidin, and immobilize biotinylated nucleic acid 

molecules to avidin/streptavidin coated supports. Another 
method requires the pre-coating of the polystyrene or glass 
solid phases with poly-L-Lys or poly L-Ly,Phe, followed by 
the covalent attachment of either amino- or sulfhydryl- 
modified oligonucleotides using bi- functional crosslinking 

30 

reagents. Both methods require the use of modified 
oligonucleotides as well as a pretreatment of the solid phase 
(see, e.g., Running, J. A. et al . , 1990, BioTechniques 8:276- 
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277; Newton, C.R. et al., 1993, Nucl . Acids Res. 21:1155- 
1162) . 

Kawai, S. et al. (1993, Anal. Biochem. 209:63-69) 
describes an alternative method in which short 
5 oligonucleotide probes were ligated together to form 
multimers, and these were ligated into a phagemid vector. 
The oligonucleotides were immobilized onto polystyrene plates 
and fixed by UV irradiation at 254 nm. A method for the 
direct covalent attachment of short, 5 ' -phosphorylated 
primers to chemically modified polystyrene plates ("Covalink" 

10 

plates, Nunc) has also been proposed by Rasmussen, S.R. et 
al. (1991, Anal, Biochem. 298:138-142) . The covalent bond 
between the modified oligonucleotide and the solid phase 
surface is created by a condensation reaction with a water- 
soluble carbodiimide . The Rasmussen method claims a 

15 predominantly 5 1 -attachment of the oligonucleotides via their 
5 1 -phosphates; however, it requires the use of specially 
prepared, expensive plates. 

Maskos, U. et al. (1992, Nucl. Acids Res. 20.: 1679-1684) 
describes a method to synthesize oligonucleotides directly 

20 onto a glass support. According to this method, a flexible 
linker with a primary hydroxyl group is bound to the solid 
support via a glycidoxypropyl silane, wherein the primary 
hydroxyl group serves as the starting point for the 
oligonucleotide synthesis. The disadvantages of this method 

25 are that the reaction is not reversible and the 

oligonucleotides leak from the solid surface during 
manipulation. 

Covalent disulfide bonds have been previously used to 
immobilize both proteins and oligonucleotides. For example, 
Carlsson, J. et al. (1991, Biotech. Applied Biochem. 14:114- 
30 120) discloses a method for the reversible immobilization of 
thiol at ed proteins and peptides to an agarose bead by means 
of a disulfide bond. In that method, the disulfide bond is 
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formed between a thiol containing protein and a thiol - 
derivatized agarose bead. The reference also discloses that 
the disulfide bond is reversible in the presence of an excess 
of dithiothreitol . Chu, B.C.F. et al. (1988, Nucl. Acids 
5 Res. 16:3671-3691) discloses a method for coupling 

oligonucleotides to nucleic acids or proteins via cleavable 
disulfide bonds. Prior to the coupling reaction, the 
oligonucleotides are modified by adding a cyst amine group to 
the 5' phosphate by means of a phosphoramadite bond. 
Sliwkowski, M.X. et al . (1983, Biochem. J. 205:731-739) 

10 

discloses a method of covalent chromatography wherein 
proteins are immobilized to cysteinylsuccinimidoproyl glass 
beads through reversible disulfide bond interaction. 

Fahy, E. et al. (1993, Nucl. Acids Res. 22:1819-1826) 
describes the synthesis of 5 1 -bromacetyl and 5' -thiol 
15 oligonucleotide derivatives and the covalent immobilization 
of these oligonucleotide derivatives via thioester bonds to 
sulfhydryl- and bromacetyl -modified polyacrylamide supports. 
The disadvantage of this method is that the covalent bond is 
not reversible. 

20 Anderson et al. (U.S. Serial No. 08/812,010, filed on 

March 5, 1997) describes a hovel method for immobilizing 
nucleic acid molecules to a solid-phase by means of a 
reversible, covalent disulfied bond. In that method, a 
disulfide bond is formed between a thiol or disulfide 

2^ containing nucleic acid molecule and a mercaptosilane coated 
solid surface. Shi .et al . (U.S. Serial No. 08/870,010) 
describes a novel method for immobilizing nucleic acid 
molecules to a solid phase by means of a covalent ether or 
thioether linkage. These simple, two-step methods have the 
specificity and efficiency needed to prepare DNA arrays. 

^ All of the above described methods can be used in the 

present invention to immobilize the sequence reagent to the 
solid support, the preferred embodiments are those disclosed 
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by Anderson et al . (supra), and by Shi et al . (supra). An 
additional preferred embodiment for immobilizing the 
sequencing reagent is to immobilize biotinylated nucleic acid 
molecules to avidin/streptavidin coated supports as disclosed 
5 by Holmstrom, K. et al. (1993, Anal. Biochem. 205:278-283) . 
Although any of a variety of glass or plastic solid 
supports can be used in accordance with the methods of the 
present invention, silicon glass is the preferred solid 
support. Preferably, the solid support is fashioned with 

^ array densities greater than 1000 elements/cm 2 . The support 
can also be fashioned as a bead, dipstick, test tube, pin, 
membrane, channel, capillary tube, column, or as an array of 
pins or glass fibers. Preferably, the plastic support is a 
form of polystyrene plastic. Alternatively, the solid 
support can be glass, preferably in the form of a microscope 

1! > slide, coverslip, a capillary tube, a glass bead, or a 
channel. The solid support can also be a glass plate, a 
quarts wafer, a nylon or nitrocellulose membrane, or a 
silicon wafer. 

20 5.1.4. ARRAY FORMATION 

In the methods of the present invention, the sequence 
reagents are intended to be made into an array. As used 
herein, an array is an orderly, spatially dependent 
arrangement of sequence reagents, as in a matrix of rows and 
columns, or a spatially addressable or separable arrangement 
such as with coated beads. Preferably, the array is an array 
of permutations of the primer sequences, such as all possible 
3mers, 4mers, 5mers, 6mers, 7mers, or combinations thereof. 

With an automated delivery system, such as a Hamilton 
robot or ink- jet printing method, it is possible to form a 

30 very complex array of oligonucleotide primers on a solid 
support, in particular an epoxysilane, mercaptosilane, or 
disulf idesilane coated solid support. Such methods can 
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deliver nano or pico-liter size droplets with , sub-millimeter 
spacing. Because the aqueous beads are extremely well 
defined, it is possible to create an array with an extremely 
high density of oligonucleotide primers. Thus, it is 
5 possible to create arrays having greater than about 10,000 
primer droplets/cm 2 . Such arrays can be assembled through 
the use of a robotic liquid dispenser, such as an ink- jet 
printing device controlled by a piezoelectric droplet 
generator, such that each nucleic acid molecule occupies a 
spot of more than about 10 microns, preferably more than 25 

10 

microns in diameter, and such that each nucleic acid spot is 
spaced no closer, center to center, than the average spot 
diameter. Methods and apparatuses for dispensing small 
amounts of fluids using such ink- jet printing techniques and 
piezoelectric ink- jet depositions have been previously 

15 describe, e.g., by Wallace, D.B. et al. (U.S. Patent No. 

4,812,856) by Hayes, D.J. et al. (U.S. Patent No. 5,053,100). 

Under one. embodiment, the array can be constructed using 
the method for Fodor, S.P. et al. (U.S. Patent No. 
5,445,934). Fodor et al. describes a method for constructing 

20 an array onto a solid surface wherein the surface is covered 
with a photoremovable group. Selected regions of the 
substrate surface are exposed to light so as to activate the 
selected regions. A monomer, which also contains a photo- 
removable group, is provided to the substrate surface to bind 

2^ to the selected area. The process is repeated to create an 
array . 

Under another preferred embodiment, the array can be 
created by means of a "gene pen". A gene pen, as used 
herein, refers to a mechanical apparatus comprising a 
reservoir for a reagent solution connected to a printing tip. 
30 The printing tip further comprises a means for mechanically 
controlling the solution flow. Under one embodiment, a 
multiplicity of gene pens or printing tips may be tightly 
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clustered together into an array, with each tip connected to 
a separate reagent reservoir. Under another embodiment, 
discrete gene pens may be contained in an indexing turntable 
and printed individually. Typically, the solid surface is 
5 pretreated to enable covalent or non-covalent attachment of 
the reagents to the solid surface. Preferably, the printing 
tip is a porous pad. 

Alternatively, the array can be created with a manual 
delivery system, such as a pipetman. Because these arrays 
are created with a manual delivery system, these arrays will 
not, in general, be as complex as those created with an 
automated delivery system. Arrays created with a manual 
delivery system will typically be spaced, center to center, 2 
2 mm apart. Depending on the delivery system employed, it is 
possible to create arrays spaced, center to center, with * 2 
15 mm spacing, 0.5-2 mm spacing, 50-500 jzn\ spacing, or s 50 
spacing. 

5.2. POLYMERASE SIGNALING ASSAY 
The present invention provides methods for analyzing the 

20 sequence of a template nucleic acid molecule; i.e., for the 
detection and/or identification of particular nucleic acid 
sequences and/or subtypes thereof in a sample. Specifically, 
the invention provides methods for an assay, referred to 
herein as the polymerase signaling assay or PSA, for 
analyzing the sequence of a template nucleic acid molecule 
using the sequence arrays described in Section 5.2, above. 
The template nucleic acid molecule to be analyzed by the PSA 
methods of the invention may be from any source. For 
example, the template nucleic acid molecule may comprise a 
naturally occuring nucleic acid molecule such as a genomic or 

30 extragenomic DNA molecule isolated from an organism, or an 
mRNA molecule isolated from an organism. Alternatively, the 
template nucleic acid molecule may comprise a synthesized 
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nucleic acid molecule, include a nucleic acid molecule 
synthesized enzymatically in vivo or in vitro. For example, 
the template nucleic acid molecule may comprise a cDNA 
molecule, or a nucleic acid molecule synthesized by PCR. 
5 The template nucleic acid molecule may additionally be 

labeled with a detectable label, including the detectable 
lables discribed in Section 5.3.4, below. Preferably, the 
detectable lable used to label the template nucleic acid 
molecule will be different from the label used to label the 
nucleotide or nucleotide analog for the primer extension 

10 

reaction, so that the two moieties, i.e., the template 
molecule and the extended primer, can be readily 
distinguished from one another. Likewise, the detectable 
label used to label the template should preferably be 
different and distinct from any label used to label the 

15 primer sequence or the sequence reagent . 

Preferably, the template nucleic acid molecule analyzed 
by the methods of this invention is a single stranded nuclei 
acid molecule, i.e., a single stranded template nucleic acid 
molecule or single stranded template. Accordingly, in 

20 embodiments wherein the initially provided is not single 

stranded, e.g., wherein a double stranded or triple stranded 
template nucleic acid molecule is initially provided, it is 
preferableto first treat the sample containing the template 
nucleic acid molecule so that a single stranded template 
nucleic acid molecule is thereby provided. However, the 
presence of an additional strand or strands does not 
necessarily have an adverse affect upon the methods of the 
invention. Accordingly, in other embodiments the template 
nucleic acid molecule may comprise nonsingle- stranded, e.g., 
double- or triple-stranded, nucleic acid molecules. 

3 0 

The template nucleic acid molecule additionally 
comprises "a capture region, as described in Section 5.2, 
above, for the capture moiety. The capture region is capabl 
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of attaching itself to the capture moiety of the sequence 
reagents. The general steps of the polymerase signaling 
assay are illustrated in FIG. 2A-C. The steps are described 
below, first briefly, and then in detail. 
5 First, during template capture (FIG. 2A) , the template 

nucleic acid molecule is contacted with the sequence reagents 
of the sequence array under conditions such that the capture 
region of the template molecule forms a stable complex with 
the capture moiety of a sequence reagent of the array. 
Second, in the template scanning (FIG. 2B) the template 

10 

' nucleic acid molecule and sequence reagent are incubated 
under conditions such that the primer region of the sequence 
reagent can identify complementary sequences, if any are 
present, in the template nucleic acid molecule. Finally, the 
primer is extended (FIG. 2C) by one or more additional 

15 nucleotides by a template dependent primer extension reaction 
mediated, e.g., by a DNA or RNA polymerase. The added 
nucleotides are detected to determine whether primer 
extension has occurred or not. The above methods are 
repeated for each sequence reagent of the sequence array, and 

20 the resultant pattern of primer extensions is then analyzed, 
e.g., by pattern comparison methods, to analyze the sequence 
of the template nucleic acid molecule. 

It will be apparent to those of skill in the art, that 
in the most preferred embodiment the steps of the invention 
are repeated by contacting a sample comprising a plurality of 

25 ■ 

the template nucleic acid molecule to be analyzed to the 
sequence array such that a template molecule is captured by 
each sequence reagent of the array, and polymerase extension 
is detected simultaneously for each sequence reagent. 
However, for simplicity the methods of the invention are 
30 described in terms of contacting a single template nucleic 
acid molecule to a single sequence reagent of an array. The 
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steps of the PSA method of the invention a discussed 
individually, and in more detail below. 

5.2.1. GENERATION OF SINGLE STRANDED TEMPLATES 
5 Preferably, the nucleic acid template molecules to be 

analyzed by the methods of the present invention will be 
pirovided as single stranded nucleic acid molecules. However, 
in certain embodiments template nucleic acid molecules may be 
provided that are not, initially, single stranded. For 
example, the template nucleic acid molecules to be analyzed 

10 

by PSA may be double stranded, or triple stranded nucleic 
acid molecules. In such instances, the presence of an 
additional strand does not necessarily affect the polymerase 
signal assay of the invention. Thus, the methods of the 
present invention may be practiced on either double-stranded, 

15 or on single stranded DNA obtained, for example, by alkali 
. treatment of native, double stranded DNA. 

Where desired, however, any of a variety of methods can 
be used to eliminate one of the two natural strands of the 
target DNA molecule from the reaction. Single stranded DNA 

20 molecules may be produced using the single -stranded 

bacteriophage M13 (see, e.g., Messing, J. et al . , 1983, Meth. 
Enzymol. 101:20; Samrook, J. et al., 1989, In: Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY) . 

Several alternative methods can be used to generate 

25 

single- stranded DNA molecules. Gyllensten, U. et al. (1988, 
Proc. Natl. Acad. Sci. U. S. A. 85:7652-7656) and Mihovilovic , 
M. et al . (1989, Biotechniques 7:14) describe a method, 
termed "asymmetric PGR", in which the standard PCR method is 
conducted using primers that are present in different 
30 molecular concentrations. Higuchi, R.G. et al . (1985, Nucl . 
Acids Res. 17:5865) exemplifies an additional method for 
generating single-stranded amplification products. The 
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method entails phosphorylating the 5' -terminus of one strand 
of a double- stranded amplification product, and then 
permitting a 5 ■ - 3 ' exonuclease to preferentially degrade 
the phosphorylated strand. 
5 Other methods have also exploited the nuclease resistant 

properties of phosphorothioate derivatives in order to 
generate single-stranded DNA molecules (see, e.g., Benkovic 
et al., U.S. Patent No. 4,521,509; Sayers, J.R. et al., 1988, 
Nucl. Acids Res. 16:791-802; Eckstein, F. et al . , 1976, 
Biochemistry 15:1685-1691; and Ott, J. et al . , 1987, 

10 

Biochemistry 25:8237-8241) . 

Most preferably, such single-stranded molecules will be 

produced using the methods described by Nikiforov, T. (U.S. 

Patent No. 5,518,900). In brief, these methods employ 

nuclease resistant nucleotide derivatives, and incorporate 
15 such derivatives, by chemical synthesis or enzymatic means, 

into primer molecules, or their extension products, in place 

of naturally occurring nucleotides. 

Suitable nucleotide derivatives include derivatives in 

which one or two of the non-bridging oxygen molecules of the 
20 phosphate moiety of a nucleotide has been replaced, e.g., 

with a sulfur- containing group such as a phosphorothioate, an 

alkyl group such as a methyl or ethyl alkyl group, a 

nitrogen- containing group such as an amine, and/or a 

selenium-containing group/ Phosphorothioate 
2 ^ deoxyribonucleotide or ribo-nucleotide derivatives are the 

most preferred nucleotide derivatives. Methods of" producing 

and using such phosphorothioate derivatives are disclosed by 

Nikiforov, T. (U.S. Patent No. 5,518,900). 



5.2.2. TEMPLATE CAPTURE 
30 Once a template nucleic acid molecule and sequence array 

have been provided, the first step of the PSA methods of the 
present invention is to contact the sample containing the 
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template molecule with the sequence array under conditions 
such that the capture region of the template nucleic acid 
molecule forms a stable complex with the capture moiety of a 
sequence reagent. In particular, the capture region of the 
5 template molecule must form a stable complex with the capture 
moiety of the sequence reagent so that the template molecule 
is immobilized to the sequence reagent of the solid-phase 
sequence array. 

The exact conditions necessary will be apparent to one 
of skill in the art for particular embodiments of the capture 

10 

moiety and capture region. For example, in a particularly 
preferred embodiment, the capture moiety is a sequence of 8 
to 24 cytosine bases which hybridizes to a capture region of 
8 to 24 guanine bases incorporated in the template nucleic 
acid molecule. In another preferred embodiment, the capture 

15 moiety can be a specific sequence complementary to a PCR 
primer, or portion thereof, used to amplify a region of the 
template strand. Thus, in such embodiments appropriate 
hybridization conditions would be conditions of high 
stringency, e.g., incubation at a final template 

20 concentration of 100 nM in 1.5 M NaCl at room temperature for 
1 hour, followed by multiple washings in IX TNTw (Tris pH 8, 
Tween 0.5%, 150 mM NaCl). 

In general, the efficiency of hybridization and the 
thermal stability of hybrids formed between such embodiments, 
wherein the capture region and capture moiety comprise a 

2 5 

target nucleic acid sequence and a short oligonucleotide 
primer, respectively, depends strongly on the nucleotide 
sequence of the primer, and on the stringency of the reaction 
conditions as discussed, e.g., in Conner, B.J. et al . , supra. 
Appropriate hybridization conditions for specific embodiments 
30 will therefore be apparent to those skilled in the art in 
view of the above and other references well known in the art. 
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5.2.3. TEMPLATE SCANNING 
After the template nucleic acid molecule has been 
immobilized to the sequence reagent as described above, the 
template -reagent complex is incubated under conditions which 
5 allow duplex formation between the primer sequence of the 
sequence reagent and sequences of the template nucleic acid 
molecule that are complementary to the primer sequence. Such 
conditions generally comprise conditions of stringent 
hybridization, such as the conditions described above for 
hybridization of a target nucleic acid sequence and a short 
oligonucleotide primer or probe. 

Primer sequences as short as 3-7 bases cannot hybridize 
to a template molecule to form stable duplexes. However, 
such primers will form transient duplexes in the presence of 
a polymerase with complementary regions of a template so that 
15 template dependent extension of the primer occurs. Such 
duplexes will only form between the template nucleic acid 
molecule and those sequence specific hybridization regions 
for which the complementary sequence is present, in one or 
more copies, in the template nucleic acid molecule. 

20 

5.2.4. PRIMER EXTENSION 
Preferably, before the primer extension reaction is 
performed, the template is capped by the addition of a 
terminator to the 3 1 -end of the template. The terminator is 
one which is capable of terminating a template -dependent , 

25 

primer extension reaction. The template is thus capped so 
that no additional nucleotides or nucleotide analogs will 
attach to the 3 ' -end of the template. The 3 ' -end of the 
template may be capped, e.g., by a dideoxynucleotide, or by 
other chain terminating nucleotides, such as those described 
30 below, in Section 5.3.4. 

The conditions for the template-dependent primer 
extension reaction require the presence of a suitable 
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template dependent enzyme, such as a DNA or RNA polymerase. 
However, the polymerase must be both primer and template 
dependent. Preferably, the polymerase is also a thermostable 
and/or exonuclease free polymerase. Exemplary DNA 
5 polymerases which may be used in the methods of the invention 
include, for example, E. coli DNA pol I or the "Klenow 
fragment" thereof, T4 DNA polymerase, T7 DNA polymerase 
{i.e., "Sequenase" ) , T. aguaticus DNA polymerase, or a 
retroviral reverse transcriptase. RNA polymerases, such as 
T3 or T7 RNA polymerase, can also be used in certain 

10 

embodiments . 

The conditions for the template-dependent primer 
extension reaction further require the presence of any one or 
more "nucleotide moieties", i.e., one or more of the 
naturally occuring nucleotides or analogs thereto. 

15 Preferably, the duplexes are incubated in the presence of all 
four of the natural deoxynucleoside triphosphates (dNTP's), 
dATP, dCTP, dGTP, and dTTP, or in the presence of analogues 
to all four of the above dNTP's. Alternatively, one or more 
of the nucleotide moieties may comprise a ribonucleotide 

20 triphosphate (rNTP) In one embodiment, the template-primer 
duplex is incubated in the presence of non-extendible or 
chain terminating nucleotides. Exemplary chain terminating 
nucleotides include 2 1 , 3 ' -dideoxynucleotide triphosphate 
derivatives of adenine, thymine cytosine, and guanine. 

In addition to dideoxynucleotides, 3'phosphate modified 
oligonucleotides which are complementary to a template 
nucleic acid molecule and effectively block DNA 
polymerization, can be used in the present invention (see, 
e.g., Kornberg et al., 1992, In: DNA Replication, 2nd 
Edition, Kornberg et al., eds., W.H. Freeman & Co., San 

30 

Francisco, pp. 408, 446-449) . Alternatively, a nucleotide 
analog, such as a fructose based nucleotide analog or a 
chemically modified purine or pyrimidine that retains the 
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ability to specifically base pair with naturally occurring 
nucleotides may be used to block DNA polymerization. A 
variety of 3 1 -substituted nucleotides ( see, e.g., Antrazhev, 
1987, Bioorg. Khim 13:1045-52; Chidgeavadze , Z.G. et al . , 
5 1986, Biochemi. Biophys. Acta 858:145-152; Chidzhacadze et 
al., 1989, Mol. Biol. (Mosk.) 23:1732-1742) such as azido- 
(Mitsuya et al . , 1986, Proc. Natl. Acad. Sci . U.S.A. 
83:1191), mercapto- (Yuzhakov et al., 1992, FEBS Letters 
305:185-188), amino- (Herrein et al . , 1994, Helvetica Chimica 
' Acta 77:586-596), and fluoro- (Chidgeavadze, Z.G. et al . , 
1985, FEBS Letters 183:275-278) substituted nucleotides, 
which have been reported to terminate DNA synthesis, can be 
used in the present invention. 

The nucleotide or nucleotide analog can be detectably 
labeled, preferably with a fluorescent molecule or haptenated 

15 

deoxy- or dideoxynucleotide . Alternatively, the nucleotide 
can be detected, e.g., using delayed extraction MALDI-TOF 
mass spectrometer (see, e.g., Haff, L.A. et al . , 1997, Genome 
Methods 7:378-388). MALDI-TOF mass* spectrometry is capable 
of determining the identity of the incorporated rion- 
20 extendible nucleotide by the change in mass of the extended 
primer. 

The use of f luorescently labeled nucleotides and 
nucleotide analogues in more preferable. Other labels 
suitable for use in the present invention include, but are 
25 not limited to, biotin, iminobiotin, antigens, cofactors, 
dinitrophenol, lipoic acid, olefinic compounds, detectable 
polypeptides, electron rich molecules, enzymes capable of 
generating a detectable signal, and radioactive isotopes. 
The preferred radioactive isotopes include 32 P, 35 S, 14 C, and 
125 I . Fluorescent molecules suitable for the present 

30 

invention include, but are not limited to, fluorescein, 
rhodamine, texas red, FAM, JOE, TAMRA, ROX, HEX, TET, Cy3, 
Cy3.5, Cy5, Cy5.5, IRD40, IRD41, and BODIPY. As used herein, 
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"FAM" refers to 5 1 carboxy- fluorescein, "JOE" refers to 2', 7'- 
dimethoxy-4 1 , 5 ' -dichloro-6-carboxy- fluorescein, " TAMRA" 
refers to N , N , N ' , N 1 - 1 e t ramethy 1 - 6 - carboxy - rhodamine , "ROX" 
refers to 6-carboxy-X-rhdoamine . Electron rich indicator 
5 molecules suitable for the present invention include, but are 
not limited to, ferritin, hemocyanin, and colloidal gold. 
Alternatively, the polypeptide may be indirectly detected by 
specifically complexing a first group to the polypeptide. A 
second group, covalently linked to an indicator molecule, 
which has affinity for the first group could be used to 

10 

detect the polypeptide. In such an embodiment, compounds 
suitable for use as' a first group include, but are not 
limited to, avidin and strepavidin. Compounds suitable for 
use as a second group include, but are not limited to, biotin 
and iminobiotin. 

15 The exact conditions necessary for the template- 

dependent primer extension reaction will be determined by the 
specific polymerase used for the reaction, as well as by the 
choice of nucleotide or nucleotide analogues. These 
conditions for specific embodiments of the primer extension 

2Q reaction will be apparent to those of skill in the art. 

In one preferred embodiment, the template nucleic acid 
molecule is separated from the sequence reagent after the 
extension reaction. This embodiment thus allows the template 
to be recovered, e.g., for further analysis. The template 
molecule is separated from the sequence reagent by means of 
appropriate denaturing conditions that are well known to 
those of skill in the art; e.g., incubating the template- 
reagent complex with heat, alkali, formamide, urea, glyoxal, 
enzymes, and combinations thereof. 

30 5.2.5. DETECTION AND ANALYSIS 

Once polymerase extension products have been permitted 
to form, the extension product is detected for each sequence 
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reagent. For embodiments wherein labeled nucleotides or 
nucleotide analogues are used in the primer extension 
reaction, signal detection consists of simply detecting the 
labeled nucleotide. 
5 For example, in embodiments wherein f luorescently 

labeled nucleotides or nucleotide analogues are used, signal 
detection is accomplished simply by detecting a fluorescent 
signal at the wavelength emitted by the fluorophore. In 
other embodiments, wherein the nucleotide or nucleotide 
analogues are labeled by means of radioactive isotopes, e.g., 

10 

32 P or 35 S labeled dNTP ! s or ddNTP's, extension products may be 
detected by using autoradiography to detect the radioactive 
base. In yet ''other embodiments which use chemical labels 
such as biotin, the labeled nucleotide or nucleotide analog 
may be detected, e.g., by means of a fluorescent probe or dye 

15 such as RPE Protein Dye (Molecular Probe) . 

The results of signal detection for an array of sequence 
reagents, each of which comprises a unique primer sequence, 
can be represented as a digital matrix or pattern of binary 
"on/off" values for each sequence, reagent of the sequence 

20 array wherein either an extension product is -detect ("on"- 
value) or no extension product is detected ( "of f" -value) . 
Each binary signal of the resulting matrix thus indicates 
that are particular primer sequence is ("on") or is not 
("off") present in the template nucleic acid molecule. The 

^ nucleotide sequence of the template molecule is thereby 
represented as a unique digital pattern. 

Such digital data can readily be stored as binary 
patterns in a computer readable format and loaded into 
computer memory for further manipulation or analysis. The 
binary patterns can be readily analyzed by simply comparing 
them, e.g., to the known binary patterns of other wild type 
of mutation nucleotide sequences. Such comparisons can be 
made simply by the manual inspection of such patterns by a 
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user, or by means of a computer algorithm using the binary 
pattern read into computer memory. 

Known binary patterns can be readily obtained, e.g., by 
applying the above methods of PSA to a template nucleic acid 
5 molecule consisting of a known wild type or mutation 

sequence. Alternatively, the binary pattern of a known wild 
type or mutation sequence can be predicted from its nucleic 
acid sequence . 

Sequences of approximately 300-500 bases can be analyzed 
by the above methods using arrays of 1024 unique sequence 

0 

reagent comprising primer sequences of 5 bases with a high 
probability of detecting and characterizing all possible 
mutations within the sequence. Nucleotide sequences of about 
100 bases can be analyzed using 256 element arrays having 
primer sequences four bases in length. Arrays of comprising 
5 primer sequences six bases in length are appropriate for 
characterizing template nucleic acid molecules of about 500- 
1,500 bases. 



6. EXAMPLES 

The examples presented here illustrate the concept and 
practice of the polymerase signaling assay methods of the 
invention. Specifically, the examples first demonstrate the 
application of the methods of the invention to comparatively 
analyzing the sequences of a "wild type" and "mutant" DNA 
oligonucleotide sequence. The examples demonstrate, second, 
the use of the polymerase signaling assay methods of the 
invention to analyze template nucleic acid molecules 
corresponding to an exon of the p53 gene, including template 
nucleic acid molecules corresponding to mutant and 
polymorphic sequences of that gene. 

The examples are presented by way of illustration of the 
previously described invention, and are not limiting of that 
description in any way. 
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6.1. EXAMPLE 1: USE OF PSA TO IDENTIFY SINGLE BASE 
SUBSTITUTIONS IN A SYNTHETIC TEMPLATE 

This example analyzes two synthetic DNA templates that 

vary by a single base using the Polymerase Signaling Assay 

(PSA) method of the present invention on a solid-phase 

sequence array. The results of this example demonstrate that 

the PSA methods of the present invention produce different 

patterns of signals for the two different template sequences 

as predicted. 

10 6.1.1. MATERIALS AND METHODS 

Template Synthesis : 

DNA oligonucleotide sequences which are complementary to 
the "wild type" and "mutant" sequences listed in Table I were 
synthesized using standard techniques of phosphoramidite 

15 chemistry (see, e.g., Matteucci, M.D. et al., supra) as 
templates for PSA analysis. The oligonucleotides were 
synthesized to further contain a 5 ' - (C) 18 tail to provide a 
capture region for attachment of the template to the sequence 
reagents . 

20 

Primer Synthesis : 

The 21 primer sequences shown in Table II were 
synthesized using the same phosphoramidite chemistry 
techniques as were used for template synthesis, describe 
above. The primers were synthesized to additionally contain 

25 a 5 ! - (G) 18 - (N) 7 tail of 2 ' -O-Me RNA nucleotides which 

functioned as a capture, and spacer region for the sequence 
reagent. A thiol group was incorporated at the 5 '-terminus 
of the sequence reagent to function as an attachment moiety 
to secure the reagent to the glass surface of the sequence 

30 array. 

Array Formation : 
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The 21 sequence reagents were printed in a five by five 
array on glass slides, according to the methods described in 
Section 5.2.3 and 5.2.4, above. 

5 Polymerase Signal Assay : 

Each template was attached to the sequence array by 
incubation at a final concentration of 100 nM in a 1.5 M NaCl 
solution at room temperature for 1 hour. Slides were washed 
three times in IX TNTw (Tris pH 8, Tween 0.5%, NaCl 150 mM) 
followed by incubation with an extension mix (described in 

10 

Head, S. et al., 1997, supra) in which all four ddNTP's were 
labeled with Biotin. After 30 minutes at room temperature, 
the extension reaction was terminated by washing the slides 
three times in IX TNTw. 3 00 jxg of RPE Protein Dye (Molecular 
Probes) was incubated on each array for 3 0 minutes, the 
15 arrays were washed and scanned on a Hitachi FMBIO. 

€.1.2. RESULTS 
The purpose of this experiment was to analyze two 
template DNA molecules that vary by a single nucleotide base 
2o using the PSA methods of the present invention. To this end, 
two template deoxyribo-oligonucleotide sequences were 
synthesized having the nucleotide sequences listed in Table 
I. For convenience, the two sequences are referred to herein 
as the "wild type" and "mutant" sequences, respectively. The 
two sequences differed by only one nucleotide base; 

25 

specifically by the substitution of a guanine for cytosirie at 
position 8 of the wild type sequence shown in Table I. This 
single nucleotide polymorphisms is indicated in Table I by 
the underlined base in each of the two sequences. The two 
template sequence also had 5'-(C) 18 tails to. function as the 
30 capture region for attachment to the capture moiety of the 
sequence reagent. 
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TABLE I 



wild type sequence 


5 ' - GTCTCTCCCAGGACAGGCACA- 3 ' 


mutant sequence 


5 ' - GTCTCTCGCAGGACAGGCACA- 3 ' 



5 A sequence array was constructed for sequence analysis 

of each of the two templates according to the methods 
described in Section 5.2, above. The sequence reagents used 
in the array had the schematic structure 

S'S- (G) 1Q - (N) 7 -NXXXXX 

2^q wherein a disulfide bond (S'S) functioned as the attachment 
moiety, a (G) 18 base sequence functioned as the capture 
moiety, and a string of seven mixed bases ( (N) 7 ) functioned 
as the spacer region. 2 1 -O-Me RNA bases were used for the 
nucleotides of both the capture moiety and the spacer region. 

^ The primer sequence of each sequence reagent comprised a 
specific base sequence of four nucleotides which functioned 
as primers to the templates during the assay. The sequence 
reagents were printed in a five by five array on glass 
slides, according to the methods described in Section 5.2, 
above . 

20 Twenty-one primer sequences were used in the assay which 

had signal sequences found in both the wild type and mutant 
templates. Their sequences are listed in Table II, below, 
along with the coordinates of their location in the sequence 
array. Table II further indicates whether primer extension 

25 is predicted for each of the 21 primer sequences, i.e., 
whether the five base primer sequence is present in either 
the wild type or mutant template sequence. 



30 
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TABLE II 



5 



20 



Primer 
Location 


Signal 
Sequence 


Expect Signal 
from Wild Type 


Expect Signal 
from Mutant 


A2 


CCTGC 


NO 


YES 


A3 


TGTGC 


YES 


YES 


A4 


GTGCC 


YES 


YES 


Bl 


TGCCT 


YES 


YES 


B2 


GCCTG 


YES 


YES 


B3 


CCTGT 


YES 


YES 


B4 


CTGCG 


NO 


YES 


B5 


CTGTC 


YES 


YES 


CI 


TGTCC 


NO 


YES 


C2 


CGAGA 


YES 


YES 


C3 


GTCCT 


YES 


YES 


C4 


TCCTG 


YES 


YES 


C5 


CCTGG 


YES 


NO 


Dl 


CTGGG 


YES 


NO 


D2 


TGGGA 


YES 


YES 


D3 


TGCGA 


NO 


YES 


D4 


GGGAG 


YES 


NO 


D5 


GGAGA 


YES 


NO 


E2 


GAGAG 


YES 


YES 


E3 


AGAGA 


YES 


YES 


E4 


GCGAG 


NO 


YES 



The expected data signals for the assay are illustrated 
graphically in FIG. 3A for the wild type template molecule, 
and in FIG. 3C for the mutant template molecule. 

The resultant experimental data patterns from PSA 
analysis of the template molecules are shown in FIGS. 3B and 
3D. FIG. 3B shows the resultant digital matrix from PSA 
analysis of the wild type sequence, while FIG 3D shows the 
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digital matrix obtained from PSA analysis of the mutant 
sequence. The two arrays show distinct signal patterns 
reflective of the sequence of the two different templates. 

5 6.2. EXAMPLE 2 : USE OF PSA TO IDENTIFY MUTATIONS AND 

POLYMORPHISMS IN A P53 GENE 

This example analyzes the 103 base pair sequence of p53 

exon 8 using the PSA methods of the present invention on a 

solid-phase sequence array. The results of this example 

demonstrate that the PSA methods of the invention can 

!0 identify several varieties of genetic mutations and/or 

polymorphisms of an actual gene, including single nucleotide 

polymorphisms, base deletions, base insertions, and 

heterozygote polymorphisms. 

15 The purpose of this experiment was to determine the 

potential applicability of the PSA methods of the invention 
for analyzing template nucleic acid molecules corresponding 
to actual genetic sequences. To this end, the PSA methods 
were used to analyze wild type and mutant sequences of the 

^ 106 base exon 8 of the p53 gene (SEQ ID NO. --) . 

Accordingly, using a computer model, expected binary 
data was generated for PSA analysis of every possible point 
mutation of the p53 exon 8 nucleic acid sequences, thereby 
generating data for 3x106=318 different nucleic acid 
sequences. The predicted digital matrices are illustrated in 

2 5 

FIGS. 4A-6C for certain, specific sequences. 

The results show that about 97% of the mutations could 
be characterized for the p53 exon 8 sequence of approximately 
100 bases using a 256 primer array of 4 base primers. The 
expected digital array generated by the wild type sequence of 
30 p53 exon 8 is illustrated schematically in FIG. 4A, while 

FIG. 4B illustrates the expected digital array generated by a 
mutant sequence wherein the nucleotide 38C has been converted 
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to a T. The difference between the two digital matrices is 
show in FIG. 4C, demonstrating that this single nucleotide 



5 



10 



15 



20 



25 
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polymorphisms generates a unique digital pattern which 
enables it to be distinguished from the wild type sequence by 
the PSA methods of the invention. FIGS. 5A-C show similar 
results for the single nucleotide polymorphism wherein the 
5 nucleotide 3 8C is transverted to an A, demonstrating that 
this sequence also generates an unique digital pattern 
enabling it to be characterized by the PSA methods of the 
invention. 

FIGS. 6A-C illustrate the results of the experiment for 
a p53 exon 8 template sequence having a heterozygous 

10 

polymorphism. The heterozygous template contains two alleles 
of the p53 exon 8 nucleotide sequence. The first allele 
comprises the wild type p53 exon 8 sequence, while the second 
allele comprises the SNP described for FIG. 4B, above; i.e., 
the SNP wherein the base 3 8C is converted to a T. This 

15 heterozygous template produces a unique digital matrix which 
can be readily distinguished from the digital signal produced 
by PSA analysis of the wild type sequence, as illustrated in 
FIG. 6C which shows the difference between the two signals. 
Further, as evidenced by a visual inspection of the two 

20 figures, the signal generated by the heterozygous SNP and 
illustrated in FIG. 6B is also unique from the homozygous 
SNP f s PSA signal pattern illustrated in FIG. 4B. 

Expected binary data was also generated for PSA analysis 
of p53 exon 8 sequences having base deletions, insertions, or 

25 heterozygous mutations. These results are illustrated in 
FIGS. 7A-8C. Specifically, FIG. 7B illustrates the binary 
signal pattern predicted for PSA analysis of a p53 exon 8 
template nucleic acid molecule having a five base deletion. 
The predicted signal for the wild type p53 exon 8 sequence is 
illustrated in FIG. 7A, and their difference is shown in FIG. 

30 

7C. Likewise, FIG. 8B illustrates the binary signal pattern 
predicted for PSA analysis of a p53 exon 8 template 
containing a five base insertion, and FIG. 8C compares this 
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signal with that of the wild type p53 exon 8 sequence, 
illustrated in FIG. 8A. The results show that these 
mutations generate signal which enable them to also be 
uniquely characterized by the PSA methods of the present 
5 invention. 

7. REFERENCES CITED 
All references cited herein are incorporated herein by 
reference in their entirety and for all purposes to the same 
extent as if each individual publication or patent or patent 

10 

application was specifically and individually indicated to be 
incorporated by reference in its entirety for all purposes. 

Many modifications and variations of this invention can 
be made without departing from its spirit and scope, as will 
be apparent to those skilled in the art. The specific 
15 embodiments described herein are offered by way of example 
only, and the invention is to be limited only by the terms of 
the appended claims, along with the full scope of equivalents 
to which such claims are entitled. 



25 



30 
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WHAT IS CLAIMED IS: 

1. A method for analyzing the sequence of a template 
comprising : 

(a) capturing the template; 
5 (b) scanning the captured template using a primer- 

polymerase complex for regions of complementarity 
to the primer; 

(c) extending the primer by one or more nucleotide 
moieties by means of a template -homology dependent 
extension reaction; and 

10 

(d) detecting the extended primer, 

wherein detection of the extended primer indicates the 
presence of one or more regions of complementarity to the 
primer in the captured template. 

15 2. The method of Claim 1 wherein the primer comprises a 
polynucleotide of 3 to 7 bases. 

3 . A method for analyzing the sequence of a template 
nucleic acid according to the methods of Claim 1, wherein the 

20 steps of the method are repeated for an array of primer- 
polymerase complexes so that a pattern of signals is 
generated for the template. 

4 . The method of Claim 3 wherein the array is an array of 
sequence reagents, each sequence reagent comprising: 

25 

(i) a capture moiety; 

(ii) a spacer moiety; and 

(iii) a primer region. 

5. The method of Claim 4 wherein the sequence reagents are 
30 immobilized to a solid surface. 
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6. The method of Claim 5 wherein the solid surface is glass 
or plastic. 

7. The method of Claim 5 wherein the solid surface is a 
glass plate, a quartz wafer, a nylon membrane, a 

5 nitrocellulose membrane, or a silicon wafer. 

8. The method of Claim 5 wherein the solid surface is 
silicon class. 

9. The method of. Claim 5 wherein the solid surface is 
polystyrene plastic. 

10. The method of Claim 4 wherein the sequence reagent 
further comprises an attachment moiety. 

!5 11. The method of Claim 10 wherein the attachment, moiety is 
located at or near the 5' -terminus of the sequence reagent. 

12. The method of Claim 10 wherein the attachment moiety is 
an amino group, a thiol group, a disulfide group, or a biotin 

2 0 group. 

13. The method of Claim 4 wherein the capture moiety is on a 
first reagent and the primer region is on a second reagent. 

14. The method of Claim 13 wherein the first reagent is 

25 

proximal to the second reagent on a solid phase. 

15. The method of Claim 4 wherein the capture moiety 
comprises a sequence of 8-24 cytosine bases. 

30 16. The method of Claim 4 wherein the capture moiety 

comprises a specific sequence complementary to a PCR primer 
or a portion thereof. 



- 45 - 



WO 99/66076 



PCT/US99/13430 



17. The method of Claim 4 wherein the spacer region is at 
least 10 A in length. 

18 . The method of Claim 4 wherein the spacer region 

5 comprises a random, pseudo- random, or non-random sequence of 
nucleotide bases or analogs thereto. 

19. The method of Claim 1 wherein the nucleotide moieties 
are non- chain terminating nucleotides or nucleotide 
analogues . 

10 

20. The method of Claim 19 wherein the nucleotide moieties 
are deoxynucleoside triphosphate bases or ribonucleoside 
triphosphate bases. 

3-5 21. The method of Claim 1 wherein the nucleotide moiety is 
chain terminating nucleotide analogue. 

22. The method of Claim 21 wherein the chain terminating 
nucleotide analogue is a dideoxynucleotide . 

20 

23. The method of Claim 1 wherein the nucleotide moiety is 
detect ably labeled. 

24. The method of Claim 23 wherein the detectable label is 
fluorescent label . 

25 

25. The method of Claim 23 wherein the detectable label is 
radioactive isotope . 

26. The method of Claim 23 wherein the detectable label is 
30 an electron rich molecule. 
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27. The method of Claim 1 wherein the extended primer is 
detected by change in mass. 

28. The method of Claim 4 wherein the density of sequence 
5 reagents in the array is at least 1000 elements/cm 2 . 

29. A sequence array comprising one or more sequence 
reagents in an orderly arrangement wherein each reagent 
comprises : 

(i) a capture moiety which can form a stable 

10 

complex with a region of a template nucleic 
acid molecule; 

(ii) a spacer region; and 

(iii) a primer region, wherein said primer region 
comprises 3-7 bases. 

15 

30. The sequence array of Claim 29 wherein the array 
comprises a set, subset, or combination of 4 3 - 4 7 different 
sequence reagents. 

20 



25 



30 
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